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1. Introduction 

Cloning and overexpression of recombinant proteins in host cells make available 
large amounts of purified proteins, natural or modified by protein engineering 
techniques to improve their cataUtic performances. Tins approach is especially 
suitable to purify isoforms of Candida mgosa lipase (CRL) that differ m 
biochemical and possibly in cataluic properties but are still similar enough to be 
onlv laboriously separated, as described in another paper of this book. 

We have cloned a large family of lipase-encoding genes that are likely 
expressed by the yeast in inequal amounts and under different growth conditions 
[1.2]. CRL isozymes are similar in molecular weight and sequence (about 80% 
patnvise identity in the ammo acid sequences) but differ in pi and glycosylation 
degree; variability in catalytic activity is suggested by experimental evidence 
[3] The product of the LIP1 gene is likely to be the major enz\nie form and its 
sequence and three-dimensional structure have been elucidated [L4]. 

For these reasons wc decided to express LIP1 in Saccharomyces cerevisiae. 
In this attempt we had to face unexpected issues concerning i) the 
translabihtv/stabiiitv of the recombinant protein in the host cells and u) the non 
universal genetic code utilized by Candida cells These topics are the subject of 
this paper 

2. Heterologous expression of LIP1 lipase 

.V cerevisiae was the host selected for expression of cloned lipase I. In our 
laboratorx .V cerevisiae is currently used for expressing rDNA proteins from 
bacteria. \ easts, plant and mammals [5] at levels as high as 15% of total cell 
proteins In the pharmaceutical industry, this organism is widely employed for 
the production of proteins like hepatitis B vaccine and human msulme Like 
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mammalian cells, yeast is able to cam out several post-translational 
modifications but it is as easy to handle as are bacterial cells Information about 
protein processing in Candida is very poor, but it is known for example, that the 
lipase proteins are N-glycosylatcd: thus a Candida protein should be correctly 
glycosvlated by Saccharomyces cells This point seems to be of importance also 
for lipase function, since a role of the sugar mojety on the stabilization of the 
CRL active conformation has been observed [4]. 

As the expression vector we used the E. coli-S. cerevisiae shuttle plasmid 
pEMBLYe\4 [6] that carries the upstream activating sequence UASGal isolated 
from yeast GAL1-GALW genes. Expression of any gene cloned downstream of 
UASGal is induced up to 1000 fold by the presence of galactose in the culture 
medium whereas it is repressed by glucose through a mechanism known as 
catabolite repression [7] 

The amino acid sequence of the UP J product is shown in Fig 1 it is 
consistent with a protein of 549 residues, the first 15 of them building a 
canonical leader sequence for secretion, a stretch of hydrophobic amino acids 
that targets the growing peptide to the secretory pathway. Sequencing of the 
mature protein N -terminus showed that this signal peptide is cut in Candida 
after the tnpeptide VAA [8] The mature protein contains 3 potential sites for 
N-glvcosvlation in the correspondence of the N-X-T/Y consensus sequence at 
asparagine 29 L 314 and 35 1 
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Figure 1. Amino acid sequence of lipase I The arrowhead marks the sue for signal 
peptidase cleavage Glycos\ laiton sues arc underlined. 
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The first step towards heterologous expression was the modification by 
polymerase chain reaction of the 5'-end of LIPl in order to add an ATG start 
codon, since the original triplet had been removed during cloning [1]. The 
complete LIPl open reading frame was inserted in pEMBLYex4 In the 
resulting plasmid LIPl is put under the control of the UASGal regulation 
sequence: its expression is modulated through the carbon source present in the 
culture medium, i.e. repressed by glucose and induced by galactose. 

S. cerevisiae cells transformed with the recombinant plasmid were induced 
by addition of galactose and analysed for both the production of specific niRNA 
and of recombinant protein Surprisingly, and despite the presence of abundant 
mRNA as revealed by Northern blot analysis, no trace of the recombinant 
protein could be found neither in the cells nor in the culture medium (Fig. 2) 

A different expression vector was obtained by replacing the original signal 
peptide (amino acids 1-15 of LIPl) with the leader sequence derived from the 
killer toxin (Kl) from another yeast. Khnveromyces lactis. This sequence 
prov ed to be verv efficient in the expression/secretion of foreign protein such as 
human mterleukm 1. as described in [9]. 



1 2 3 



1234 56 78 9 




Figure 2 Western blotting of cell extracts of 5". cerevisiae cells transformed with 
inducible plasmids containing LIPl proceeded cither by its lender sequence (top) or by 
thnt derived from the K. lactis killer toxin (bottom). Top (homologous lender 
sequence ) cell extracts from cells grown either in glucose (2) or in galactose minimal 
medium (3). Lane I contain I Mg of purified lipase Bottom (heterologous I s ): cells 
grown cither in glucose (2) or in galactose medium to the exponential or stationary 
phase of growth (3 and 4 respectively). (5-9) Production of recombinant lipase in 
biorcactor under repression (5) or induction conditions (6-9) Lane I contain 1 ng of 
purified lipase Exlnicts were obtained from M> Cl ceils 
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When yeast cells transformed with the chimeric plasmid were induced by 
galactose, high levels of specific mRNA were detected by Northern blot analysis 
and recombinant lipase I was accumulated within the host cells (Fig. 2). The 
enz\-me was s\*nthesized over several hours during the exponential and the 
stationary phases of growth. Its level was evaluated to be about 10-20 mg/liter 
of culture, a production comparable to that obtained in our laboratory with 
similar expression systems using batch fermentation. 

The process was scaled up in a two-stage computer-controlled fermentation 
system [10]. Fed batch fermentation was performed in a 2 1 aereated, stirred- 
tank bioreactor equipped with temperature, agitation, air flow rate, dissolved 
oxygen. pH and ethanol controllers. The control system was based on the 
determination of the ethanol concentration in the bioreactor outflow gases, 
allowing to regulate the supply rate of nutrients to the culture so as to prevent 
inhibition of cell growth. In this way. a high yield of conversion of glucose to 
biomass (0.48 g/g) and an elevated volumetric production and productivity can 
be obtained (up to 100 g of dry weigth per liter) [10]. 

To induce LIP I expression, a mixed substrate was added in the feed: the 
process was started with 50% w/v glucose, induction was obtained by changing 
the carbon source to 25%glucosc-25%galactose after 40 hours and to 50% 
galactose after 80 hours. Fermentation was stopped after 115 hours when cell 
density in the bioreactor was 10 9 cells/ml. Up to 1 g/1 of recombinant lipase I 
was obtained and the protein was accumulated over several hours of 
fermentation (Fig. 2). 

In conclusion, these experiments suggest that the lipase natural leader 
sequence, although very efficient in Candida cells, is not suited for expression in 
S cerevisiae. S. cerevisiae cells are usually quite permissive in the recognition 
of foreign signals and several eukaryotic proteins preceeded by their own signals 
have been successfully expressed and secreted . However, a correct processing 
of the recombinant protein cannot be assumed a prion, since for example, 
expression of interferon preceeded by its natural leader sequence resulted in 
protein degradation [II]. Whereas the role of the leader sequence in secretion is 
well established since several years, its effect on protein translability/stability is 
still uncertain, also because it seems to depend on the protein itself 

To investigate the fate of the recombinant protein within the yeast cells, cell 
extracts were subjected to treatment with endoglycosidase F to remove 
carbohydrates. A decrease in the molecular weight of the recombinant protein 
was evidenced in Western blots (Fig 3). showing that lipase had been correctly 
targeted to the secretory pathway, since glycosylation first occurs in the 
endoplasmic reticulum The recombinant lipase however, was retained in the cell 
membrane and was not released neither in the culture medium nor in the 
periplasms space (Fig 3) This is very likely to depend on a misfoldmg of the 
recombinant protein due to the erroneous introduction of leucine residues in the 



223 



correspondence of CUG codons. a problem deriving from the non universal 
usage of this codon by C rugosa cells, as discussed in the following. 



1 2 3 4 5 6 7 




Figure 3 Transport of rLIPl in yeast cells Extracts were obtained from I0 7 cells 
exponentially growing in galactose (induction) medium and subjected to Western 
blotting (2) untreated sample; O) after deglycosylation with endoglycosidase F. (4-7) 
separation of cells fraction: membrane fraction of cells grown in glucose (4) or 
galactose (5-6) and cMoplasniatic fraction of cells induced by galactose. The control 
purified protein is shown in 1. 

2. Mutagenesis of CUG codons 

In 1989 Kawaguchi and colleagues unexpectedly reported that codon CUG was 
read as serine instead of leucine by C. rugosa. This finding was obtained by 
comparing the content of serine and leucine predicted from the nucleotide 
sequence of L1P1 cDNA and that experimentally determined [8]. This was the 
first example of a deviation from the universal genetic code in nuclear genes, 
since previously this phenomenon was believed to be restricted to mithocondrial 
genomes [12]. Shortly later. CUG-Ser was found also in other Candida species, 
namely C. albicans. C. parapsilosis. C mehbwsica. C. zeylanoides and C 
maltosa [13,14]. 

This might seriously hinder the expression of Candida genes in any host 
organism, since the translational machinery of the host cells would introduce 
leucine residues at any CUG codon. where a serine was present in the original 
protein This proved to be not crucial for several Candida proteins that were 
cloned by complementation of Saccharomyccs mutations, as for example the 
orotidine 5'- phosphate decarboxylase from C mahosa [15] All these genes 
carried only a few CUG codons. apparently at positions where a substitution 
can be accepted. 
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In C. rugosa lipases however. CUG codons are very frequent and even the 
catalytic serine is CUG-encoded (Fig. 4). Thus, an extensive mutagenesis 
program might be necessary to obtain the expression of a functional enzyme. In 
the follow ing are presented preliminary results of this study that was performed 
w ith two main purposes: the production of a functional recombinant lipase and 
the investigation of basic issues on the molecular evolution of CRL and the 
origin of the deviation from the universal genetic code. 

CRL genes appear to be unique within the Candida genus for their high 
usage of the codon CUG (see table 1) that is employed for 40% of all serine 
codons. in contrast to several other Candida species, where CUG account for 
only 2-3% of total serine codons (G. Pesole et al., unpublished results). 

Table I - Frequencies of codons for serine and leucine in C rugosa lipases^ 1 ) 



Leucine 





LIP1 


LIP2 


LIP3 


L1P4 


LIP5 


CUG 


0.42 


0.38 


0.41 


0.36 


0.37 


AGU 


0.02 


0.09 


0.09 


0.04 


0.06 


AGC 


0.25 


0.29 


0.28 


0.29 


0.28 


UCG 


0.17 


0.13 


0.13 


0 16 


0.1 1 


UCA 


0.02 


0.02 


0.00 


0.02 


0.00 


UCU 


0.00 


0.02 


0.02 


0.02 


0.02 


UCC 


0.1 1 


0.07 


0.06 


0.08 


0 15 


UUG 


0.44 


0.43 


0.47 


0.44 


0.42 


UUA 


0.00 


0.00 


0.00 


0.00 


0.00 


CUA 


0.00 


0.00 


0.00 


0.00 


0.00 


cuu 


0.08 


0.2! 


0.10 


0.15 


0.15 


cue 


0.48 


0.36 


0.43 


0.40 


0.43 



( * ) Values arc given as percentages of total codons for each ammo acid 
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In the LIP J protein. 19 serines (enclosed the cataKlic serine 209) are CUG- 
sennes. By comparing the amino acid sequences of the five CRL enz>-mes 
sequenced to date, one can observe that 13 CUG serines are conserved in all 
isoforrns (Fig. 4). This might be suggestive of their importance in CRL structure. 
A broader analysis earned out with enzymes belonging to the esterase/lipase 
family which includes the C rugoso and G candidum lipases together with 
esterases and acetylcholinesterases [16], shows that at least three of these 
serines: S209, S241 and S282 are conserved also in closely related enzymes. 
The structural role of these residues is confirmed by the inspection of their 
position in the enz\me three-dimensional structure Serine 241 for example, 
interacts with two catalytic residues, S209 and E341. These codons were 
therefore mutated by polymerase chain reaction to universal serine TCC triplets 
and reintroduced in the LIP1 gene cloned in the expression plasmid described 
above. A high level of expression was obtained in experimental conditions very 
similar to those described in the previous paragraph but the protein was 
accumulated within the cells in an inactive form. 




Figure 4 CA trace of CRL1 with the conserved serines and ihe lid in bold. The figure 
was generated by Turbo-Frodo [20] 
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A second group of target residues was identified taking into account their 
structural role according to the hydrogen bonds and snlt bridges pattern and 
mutagenized Once more, the protein was retained in the endoplasmic reticulum, 
although serines important for the structure of the protein core had been 
restaured 

These results seem to suggest that recovery of the inner core conformation is 
not sufficient to obtain a functional enzyme ) a j"j 

Some intriguing questions arise from the study of C. rugosa lipases, in 
particular how could such a deviation from the universal genetic code originate 
and be maintained in the lipase enzymes where it concerns c\en the catalytic 
residue. 

By aligning the sequences of the small nbosomal subunits rRNAs of all 
known Candida species and other related yeast [17]. we could observe that the 
genus Candida includes a monopln letic subgroup, several members of which ^ 
have been shown by Ohama and colleagues [13] to use Cl'G as a codon for ver ^ 
serine. Interestingly, other members of this group only rarely employ CUG ( ] ets 
which is the major serine codon in C rugosa. This finding can be partly k ec j 
explained by the high OG content of the C. rugosa genome and by the . 
existence of multiple genes for tRNA(CAG)[ 1 8]. but docs not satisfactory ^ 
explain how a non-functional mutation could arise and be maintained in an 
enzyme active site. In fact, we should hypothesize that a non-functional lipase 
originated following CUG unassigiiment. One can hypothesize that either codon 
change was made possible by the presence of multiple copies of lipase genes so 
that a deleterious mutation would not affect the viability of the cell, or that 
lipase was a dispensable function. We have shown that (\ rugosa possess 
multiple copies of the lipase genes and it is a diploid organism 1 19]. Thus it w as 
possible for non functional copies of this gene to be maintained within the 
genome and to assume their functional role after codon reassignment. 

Both the high number of CUG serines and their conservation in the CRT. 
gene family- suggest the action of a strong selective pressure on the 
assignment/usage of this codon. 
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